Proposal of an Exploitation-oriented Learning Method on Multiple Rewards and Penalties Environments and the Design Guideline

نویسنده

  • Kazuteru Miyazaki
چکیده

Among machine-learning approaches, reinforcement learning (RL) focuses most on goal-directed learning from interaction. Despite important applications, RL is difficult to design to fit real-world problems because, first, interaction requires too many trial-and-error searches and, second, no guidelines exist on how to design reward and penalty signal values. We are interested in approaches treating reward and penalty signals independently and not assigning them values. We also want to reduce the number of trial-and-error searches by strongly enhancing successful experience — a process known as exploitationoriented learning (XoL). Though there are many XoL methods, they cannot apply to multiple rewards and penalties environments adequately. In this paper, we propose a new XoL method that can treat multiple rewards and penalties effectively. We present simulation and experimental results to show the effectiveness of our proposal. Furthermore, we describe the design guideline about rewards and penalties for the XoL methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing collaborative learning model in online learning environments

Introduction: Most online learning environments are challenging for the design of collaborative learning activities to achieve high-level learning skills. Therefore, the purpose of this study was to design and validate a model for collaborative learning in online learning environments. Methods: The research method used in this study was a mixed method, including qualitative content analysis and...

متن کامل

e-learning Utilization Based on the Problem-Solving Approach

Introduction & Objective: Paying attention to the process and approaches to the problem solving from the view of the e-learning courses designers, will improve the aspects of development. The problem-based learning provides the discovery structure and helps the students to internalize their learning. Therefore, the purpose of this study is to investigate the factors that lead to more utili...

متن کامل

Effective Environmental Factors on Designing Productive Learning Environments

Educational spaces play an important role in enhancing learning productivity levels of society people as the most important places to human train. Considering the cost, time and energy spending on these spaces, trying to design efficient and optimized environment is a necessity. Achieving efficient environments requires changing environmental criteria so that they can have a positive impact on ...

متن کامل

The Effect of Metacognition Instruction in Multimedia-based Learning Environments on Nursing Students’ Spiritual Health

Background: One of the main competencies required for enabling Nursing students to provide effective clinical care is spiritual health. The growth and development of nursing students’ spiritual health rely on strengthening their cognitive and metacognitive components. What is more associated with spirituality and spiritual health is students’ metacognition. This study aimed to investigate the e...

متن کامل

Design and Validation of an Instructional Design Model for Reflection-Based Learning Environments 

Design and Validation of an Instructional Design Model for Reflection-Based Learning Environments   E. Azimi, Ph.D.* J. Haatami, Ph.D.** H. FarDaanesh, Ph.D.*** O. Noroozi, Ph.D.****   Reflection on teaching is a known method of learning to teach. Reflection is a form of thinking wherein improvement is sought through self-observation. Recent approaches to teaching practicums have gravi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JCP

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2013